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A two-dimensional gel database of rat liver proteins 
useful in gene regulation and drug effects studies 

A standard two-dimensional (2-D) protein map of Fischer 344 rat liver 
(F344MST3) is presented, with a tabular listing of more than 1200 protein species. 
Sodium dodecyl sulfate (SDS) molecular mass and isoelectric point have been es- 
tablished, based on positions of numerous internal standards. This map has been 
used to connect and compare hundreds of 2-D gels of rat liver samples from a va- 
riety of studies, and forms the nucleus of an expanding database describing rat 
liver proteins and their regulation by various drugs and toxic agents. An example 
of such a study, involving regulation of cholesterol synthesis by cholesterol-lower- 
ing drugs and a high-cholesterol diet, is presented. Since the map has been ob- 
tained with a widely used and highly reproducible 2-D gel system (the Iso-Dalt® 
system), it can be directly related to an expanding body of work in other laborato- 
ries. 
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1 Introduction 

High-resolution two-dimensional electrophoresis of pro- 
teins, introduced in 1975 by O'Farrell and others [1-4], has 
been used over the ensuing 16 years to examine a wide va- 
riety of biological systems, the results appearing in more 
than 5000 published papers. With the advent of computer- 
ized systems for analyzing two-dimensional (2-D) gel ima- 
ges and constructing spot databases, it is also possible to 
plan and assemble integrated bodies of information de- 
scribing the appearance and regulation of thousands of pro- 
tein gene products [5, 6]. Creating such databases involves 
amassing and organizing quantitative data from thousands 
of 2-D gels, and requires a substantial commitment in tech- 
nology and resources. 

Given the long-term effort required to develop a protein da- 
tabase, the choice of a biological system takes on consider- 
able importance. While in vitro systems are ideal for answer- 
ing many experimental questions, especially in cancer re- 
search and genetics, our experience with cell cultures and 
tissue samples suggests that some in vivo approaches could 
have major advantages. In particular, we have noticed that 
liver tissue samples from rats and mice appearto show grea- 
ter quantitative reproducibility (in terms of individual pro- 
tein expression) than replicate ceil cultures. This is perhaps 
a natural result of the homeostasis maintained in a com- 
plete animal vs. the well-known variability of cell cultures, 
the latter due principally to differences in reagents (e.g., 
fetal bovine serum), conditions (e.g., pH) and genetic "evo- 
lution" of cell lines while in culture. It is also more difficult 
to generate adequate amounts of protein from cell culture 
systems (particularly with attached cells), forcing the inves- 
tigator to resort to radioisotope-based or silver-based stain- 
detection methods. While these methods are more sensi- 
tive (sometimes much more sensitive) than the Coomassie 
Brilliant Blue (CBB) stain typically used for protein detec- 
tion in "large" protein samples, they are generally more vari- 
able, more labor-intensive and, in the case of radiographic 
methods, may generate highly "noisy" images, due to the 
properties of the films used. By contrast, large protein sam- 
ples can easily be prepared from liver using urea/Nonidet 
P-40 (NP-40) solubilization and stained with CBB, which 
has the advantage of being easily reproducible [8]. Finally, 
there remains the question of the "truthfulness" of many in 
vitro systems as compared to their in vivo analogs; how 
great are the changes caused by the introduction into a cul- 
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ture and the associated shift to strong selection for growth, 
and how do these affect experimental °»^£ H ""* 
the apparent advantages of to vitro systems, u i terms ; of ex- 
perimental manipulation, may be counterbalanced by 
other factors relating to 2-D data quality. 

There is a second important class of reasons for exploring 
the use of Z in vivo biological system such as ; the ^His- 
torically there have been two broad approaches to the me- 
3 dissection of biochemical processes in intact cel- 
Ma? systems: genetics (a search for informative mutants) 
Ln5 Slhemical agents (drugs and chemical toxins). 
Both approaches help us to understand complex systems 
by disrupting some specific functional element and show- 
KsThe result. With the development of techniques for 
genetic manipulation and cloning, the genetic approach 
am be effectively applied either in vitro or in vivo although 
to in vitro route is usually quicker. The chemical approach 
can also be applied to eithersort of biological system; here, 
nwever the bulk of consistently acquired information is 
?n experimental animals (rats and mice). While most biolo- 
gists know a short list of compounds having specific, experi- 
mentally useful effects (eg., inhibitors ;of proteir l syn hesis, 
ionophores, polymerase inhibitors, channel blockers nu- 
cleotide analogs, and compounds affecting polymerization 
of cytoskeletal proteins), there is a much largei ■number of 
interesting chemically-induced effects, most of them char- 
acterized by toxicologists and pharmacologists in rodent 
sy terns. Just as a thorough genetic analysis would involve 
saturating a genome with mutations, it « Possible to ima- 
gine a saturating number of drugs, the analysis of whose ac- 
tions would reveal the complete biochemistry of the cell. 
While organized drug discovery efforts usually target spe- 
cific desired effects, the nature of the process, with its de- 
pendence on screening large numbers of compounds, ne- 
cessarily produces many unanticipated effects. It is there- 
fore reasonable to suppose that the required broad range of 
compounds necessary to achieve "biochemical saturation 
may be forthcoming; in fact, it may already exist among he 
hundreds of thousands of compounds that failed to qualify 
as drugs. 
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has been made in the development of mouse, rat and hu- 
man hepatocyte culture systems, as well as in precision-cut 
tissue slices. Using such an array of techniques, it is possi- 
ble to assemble a matrix of mammalian systems including 
mouse and rat in vivo on one level and mouse, rat and hu- 
man in vitro on a second level, and to compare effects be- 
tween species and between systems. This approach allows 
us to draw informed conclusions regarding the biochemical 
"universality" of biological responses among the mammals, 
and to offer some insight into the validity of in vitro ay 
proaches for toxicologic^ screening. We believe this data 
will be necessary if in vitro alternatives are to achieve wide 
usage in government-mandatedsafetytestingofdrugs.con- 
sumer products and industrial and agricultural chemicals. 

A number of interesting studies have been published using 
2-D mapping to examine effects in the rodent liver. A num- 
ber of investigarors have made use of the technique to 
screen for existing genetic variants [8-1 1] or induced muta- 
tions [12-14], mainly in the mouse.This work builds on the 
wealth of genetic information available on the mouse and 
its established position as a mammalian mutation-detec- 
tion system. While some studies of chemical effects have 
been undertaken in the mouse [15-17], most have used the 
rat T18-231 The examination of the cytochrome p-450 sys- 
tem, in particular, has been carried out almost exclusively 
on the rat [24, 25]. 

These considerations lead us to conclude that rodent liver 
offers the best opportunity to systematically examine . an 
array of gene regulation systems, and ultimately to build a 
predictive model of large-scale mammalian gene control. 
The basic underlying foundation of such a project is a reli- 
able, reproducible master 2-D pattern of liver, to which on- 
going experimental results can be referred. In this paper, we 
report such a master pattern for the acidic and neutral pro- 
teinsofrat liver (pattern F344MST3). In future, this master 
will be supplemented by maps of basic protems.and analog- 
ous maps of mouse and human liver. 



Among organs, the liver is an obvious choice for the study 
of chemical effects because of its well-known plasticity and 
responsiveness. The brain appears to be quite plastic (e.g. 
[7]) but it is a. complicated mixture of cell types requiring 
skillful dissection for most experiments. The kidney, while 
quite responsive, also presents a potentially confounding 
mixture of cell types. The liver, by contrast, is made up of 
one predominant cell type which is easy to solubilize: the 
hepatocyte, representing more than 95% of its mass. Most 
importantly, the liver performs many homeostatic func- 
tions that require rapid modulation of gene expression. It 
appears that most chemical agents tested affect gene ex- 
pression in the liver at some dosage (N. Leigh Anderson, 
unpublished observations), an interesting contrast to our 
earlier work with lymphocytes, for example, which seem to 
be much less responsive. Such results conform to the expec- 
tation that cells with a homeostatic, physiological role 
should be more plastic than cells differentiated for a pur- 
pose dependent on the action of a limited number of spe- 
cific genes. 

The liver also allows the parallels between in vitro and in 
vivo systems to be examined in detail. Significant progress 



2 Materials and methods 
2.1 Sample preparation 

Liver is an ideal sample material formost biochemical stud- 
ies including 2-D analysis. A sample is taken of approxima- 
tely 0 5 g of tissue from the apical end of the left lobe ot the 
liver. Solubilization is effected as rapidly as practical; a 
delay of 5-15 min appears to cause no major alteration in 
liver protein composition if the liver pieces are kept cold 
(e g on ice) in the interim. In the solubilization process, 
the liver sample is weighed, placed in a glass homogenizer 
(e.g., 15 mL Wheaton); 8 volumes of solubihzmg solution 

l-mlsolubUixing solution is composed of 2* NM? (?UP«A9 " V£ a 
(analytical grade, e.g.. BDH or Bio-Rad), 0.5% d.thiothre.tol (DTT 
Sigma) a„d!%carrierampholytes(pH9-ll LKB: mese come- 20* 
stock so.u t ion,so2%r.nalconcentrationisache^ 
solution 10*9-11 Ampholine by volume) A large bat,* of solub.hz" 
(several hundred mL) is made and stored frozen at -80 C n al quo« 
sufficient to provide enough for one day's est.mated sample prepara 
tion requirement. The solution is never allowed to become warmer 
han room temperature at any stage during preparat.on mm 
use. since heating of concentrated urea solutions can produce , »num. 
nants that covalently modify proteins producing artifactual charge 
shifts. Once thawed, any unused solubilizer is discarded. 
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is added (/.<?., 4 mL per 0.5 g tissue) and the mixture is ho- 
mogenized using first the loose- and then then the tight-fit- 
ting glass pestle. This takes approximately 5 strokes with 
each pestle and is carried out at room temperature because 
urea would crystallize out in the cold. Once the liver sample 
is thoroughly homogenized in the solubilizer, it is assumed 
that all the proteins are denatured (by the chaotropic effect 
of the urea and NP-40 detergent) and the enzymes inacti- 
vated by the high pH (-9.5). Therefore these samples may 
be kept at room temperature until they can be centrifuged 
or frozen as a group (within several hours of preparation). 
The samples are centrifuged for 6 X 10 6 g min (e.g., 500 000 
X g for 12 min using a Beckman TL-100 centrifuge). The 
centrifuge rotor is maintained at just below room tempera- 
ture (e.g., 15-20 °C), but not too cold, so as to prevent the 
precipitation of urea. The centrifuge of choice is a Beckman 
TL-100 because of the sample tube sizes available, but any 
ultracentrifuge accepting smallish tubes will suffice. When 
an appropriate centrifuge is not available near the site of 
sample preparation, samples can be frozen at -80 °C and 
thawed prior to centrifugation and collection of superna- 
tants. Each supernatant is carefully removed following cen- 
trifugation and aliquoted into at least 4 clean tubes for stor- 
age. This is done by transferring all the supernatant to one 
clean tube, mixing this gently (to assure homogeneous 
composition) and then dividing it into 4 aliquots. The ali- 
quots are frozen immediately at -80°C. These multiple ali- 
quots can provide insurance against a failed run or a freezer 
breakdown. 

2.2 TWo-dimensional electrophoresis 

Sample proteins are resolved by 2-D electrophoresis using 
the 20 X 25 cm Iso-Dalt® 2-D gel system ([26-29]; pro- 
duced by LSB and by Hoefer Scientific Instruments, San 
Francisco) operating with 20 gels per batch. All first-dimen- 
sional isoelectric focusing (IEF) gels are prepared using the 
same single standardized batch of carrier ampholytes 
(BDH 4-8A in the present case, selected by LSB's batch- 
testing program for rat and mouse database work**). A 10 
liL sample of solubilized liver protein is applied to each gel, 
and the gels are run for 33 000 to 34500 volt-hours using a 
progressively increasing voltage protocol implemented by 
a programmable high-voltage power supply. An Ange- 
lique™ computer-controlled gradient-casting system (pro- 
duced by LSB) is used to prepare second-dimensional sod- 
ium dodecyl sulfate (SDS) polyacrylamide gradient slab 
gels in which the top 5 % of the gel is 1 1 %T acrylamide, and 
the lower 95% of the gel varies linearly from 1 1% to 18 %T. 

This system has recently been modified so as to employ a 
commercially available 30.8 %T acrylamide/ iV^-methyle- 
nebisacrylamide prepared solution (thus avoiding the han- 
dling of the solid acrylamide monomer) and three addi- 
tional stock solutions: buffer (made from Sigma pre-set 
Tris), persulfate and N, N, iV\ AP-te trame thy le thy lenedi- 
amine (TEMED). Each, gel is identified by a computer- 
printed filter paper label polymerized into the lower left cor- 
ner of the gel. First-dimensional IEF tube gels are loaded 

** This material (succeeding certified batches of which are available from 
Hoefer Scientific Instruments) has the most linear pH gradient pro- 
duced by any ampholyte tested except for the Pharmacia wide range 
(which has an unacceptable tendency to bind high-molecular weight 
acidic proteins, causing them to streak). 



directly (as extruded) onto the slab gels without equilibra- 
tion, and held in place by polyester fabric wedges (Wed- 
gies™, produced by LSB) to avoid the use of hot agarose. 
Second-dimensional slab gels are run overnight, in groups 
of 20, in cooled DALT tanks (10°C) with buffer circulation. 
Ail run parameters, reagent source and lot information, 
and notations of deviation from expected results are ente- 
red by the technician responsible on a detailed, multi-page 
record of the experiment. 

2.3 Staining 

Following SDS-electrophoresis, slab gels are stained for 
protein using a colloidal Coomassie Blue G-250 procedure 
in covered plastic boxes, with 10 gels (totalling approxima- 
tely 1 L of gel) per box. This procedure (based on the work 
of Neuhoff [30, 31]) involves fixation in 1.5 L of 50% etha- 
nol and 2% phosphoric acid for 2h, three 30 min washes, 
each in 2 L of cold tap water, and transfer to 1.5 L of 34% 
methanol, 17% ammonium sulfate and 2 % phosphoric acid 
for 1 h, followed by the addition of a gram of powdered Coo- 
massie Blue G-250 stain. Staining requires approximately 4 
days to reach equilibrium intensity, whereupon gels are 
transferred to cool tap water and their surfaces rinsed to re- 
move any particulate stain prior to scanning. Gels may be 
kept for several months in water with added sodium azide. 
The water washes remove ethanol that would dissolve the 
stain (and render the system noncolloidal, with high back- 
grounds). The concentrated ammonium sulfate and meth- 
anol solution is diluted by equilibration with the water vol- 
ume of the gels to automatically achieve the correct final 
concentrations for colloidal staining. Practical advantages 
of this staining approach can be summarized as follows: (i) 
the low, flat background makes computer evaluation of 
small spots (max OD < 0.02) possible, especially when 
using laser densitometry; (ii) up to 1500 spots can be reli- 
ably detected on many gels (e.g., rat liver) at loadings low 
enough to preserve excellent resolution; and (iii) reprodu- 
cibility appears to be very good: at least several hundred 
spots have coefficients of reproducibility less than 15%. 
This value is at least as good as previous CBB methods, and 
significantly better than many silver stain systems. 

2.4 Positional standardization 

The carbamylated rabbit muscle creatine phosphokinase 
(CPK) standards [32] are purchased from Pharmacia and 
BDH. Amino acid compositions, and numbers of residues 
present in proteins used for internal standardization, are 
taken from the Protein Identification Resource (PIR) se- 
quence database [33]. 

2.5 Computer analysis 

Stained slab gels are digitized in red light at 134 micron re- 
solution, using either a Molecular Dynamics laser scanner 
(with pixel sampling) or an Eikonix 78/99 CCD scanner. 
Raw digitized gel images are archived on high-density DAT 
tape (or equivalent storage media) and a greyscale video- 
print prepared from the raw digital image as hard-copy 
backup of the gel image. Gels are processed using the Kep- 
ler® software system (produced by LSB), a commercially 
available workstation-based software package built on 
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some of the principles of the earlier TYCHO system [34- 
41]. Procedure PROC008 is used to yield a spotlist giving 
position, shape and density information for each detected 
spot. This procedure makes use of digital filtering, mathe- 
matical morphology techniques and digital masking to re- 
move the background, and uses full 2-D least-squares opti- 
mization to refine the parameters of a 2-D Gaussian shape 
for each spot. Processing parameters and file locations are 
stored in a relational database, while various log files detail- 
ing operation of the automatic analysis software are ar- 
chived with the reduced data.The computed resolution and 
level of Gaussian convergence of each gel are inspected 
and archived for quality control purposes. 

Experiment packages are constructed using the Kepler ex- 
periment definition database to assemble groups of 2-D 
patterns corresponding to the experimental groups (e.g., 
treated and control animals). Each 2-D pattern is matched 
to the appropriate "master" 2-D pattern (pattern 
F344MST3 in the case of Fischer 344 rat liver), thereby 
providing linkage to the existing rodent protein 2-D data- 
bases. The software allows experiments containing hun- 
dreds of gels to be constructed and analyzed as a unit, with 
up to 100 gels displayed on the screen at one time for com- 
parative purposes and multiple pages to accommodate ex- 
periments of > 1000 gels. For each treatment, proteins 
showing significant quantitative differences vs. appropriate 
controls are selected using group-wise statistical parame- 
ters (e.g., Student's t-test, Kepler® procedure STUDENT). 
Proteins satisfying various quantitative criteria (such as P< 
0.001 difference from appropriate controls) are repre- 
sented as highlighted spots onscreen or on computer-plot- 
ted protein maps and stored as spot populations (i.e., logi- 
cal vectors) in a liver protein database. Quantitative data 
(spot parameters, statistical or other computed values) are 
stored as real-valued vectors in the database. Analysis of co- 
regulation is performed using a Pierson product-moment 
correlation (Kepler procedure CORREL) to determine 
whether groups of proteins are coordinateiy regulated by 
any of the treatments. Such groups can be presented graphi- 
cally on a protein map, and reported together with the statis- 
tical criteria used to assess the level of coregulation. Multi- 
variate statistical analysis (e.g., principal components' ana- 
lysis) is performed on data exported to SAS (SAS Institute). 

2.6 Graphical data output 

Graphical results are prepared in GKS and translated 
within Kepler® into output for any of a variety of devices. 
Linedrawing output is typically prepared as Postscript and 
printed on an Apple LaserWriter. Detailed maps presented 
here have been generated using an ultra-high-resolution 
Postscript-compatible Linotronic output device. Greyscale 
graphics are reproduced from the workstation screen using 
a Seikosha videoprinter. Patterns are shown in the standard 
orientation, with high molecular mass at the top and acidic 
proteins to the left. 

2.7 Experiment LSBC04 

In the study described here 12-week-old Charles River 
male F344 rats were used. Diets were prepared at LSB, 
based on a Purina 5755M Basal Purified Diet. Lovastatin 
and cholestyramine were obtained as prescription pharma- 



ceuticals, ground and mixed with the diet at concentrations 
of 0.075% and 1%, respectively. The high cholesterol diet 
was Purina 5801M-A (5% cholesterol plus 1% sodium cho- 
iate in the control diet). Animal work was carried out by Mi- 
crobiological Associates (Bethesda, MD). Animals were ac- 
climatized for one week on the control diet, fed test or con- 
trol diets for one week, and sacrificed on day 8. Average 
daily doses of lovastatin and cholestyramine in appropriate 
groups were 37 mg/kg/day and 5 g/kg/day, respectively, 
based on the weight of the food consumed. Liver samples 
were collected and prepared for 2-D electrophoresis accord- 
ing to the standard liver protocol (homogenization in 8 
volumes of 9 m urea, 2% NP-40, 0.5% dithiothreitol, 2% 
LKB pH 9-11 carrier ampholytes, followed by centrifuga- 
tion for 30 min at 80000 X g). Kidney, brain and plasma 
samples were frozen. Gels were run as described above, 
and the data was analyzed using the Kepler* system. Gels 
were scaled, to remove the effect of differences in protein 
loading, by setting the summed abundances of a large num- 
ber of matched spots equal for each gel (linear scaling). 



3 Results and discussion 

3.1 The rat liver protein 2-D map 

F344MST3 is a standard 2-D pattern of rat liver proteins, 
based on the Fischer 344 strain. This pattern was initiated 
from a single 2-D gel and extensively edited in an experi- 
ment comparing it to a range of protein loads, so as to in- 
clude both small spots and well-resolved representations of 
high-abundance spots. More than 700 rat liver 2-D patterns 
have been matched to F344MST3 in a series of drug effects 
and protein characterization experiments, and numerous 
new spots (induced by specific drugs, for instance) have 
been added as a result. A modified version including addi- 
tional spots present in the Sprague-Dawley outbred rat has 
also been developed (data not shown). Figure 1 shows a 
greyscale representation and Fig. 2 a schematic plot of the 
master pattern. More than 1200 spots are included, most of 
which are visible on typical gels loaded with 10 \iL of solubi- 
lized liver protein prepared by the standard method and 
stained with colloidal Coomassie Blue. Master spot num- 
bers (MSN's) have been assigned to all proteins, and ap- 
pear in the following figures, each showing one quadrant of 
the pattern. Figure 3 shows the upper left (acidic, high 
molecular mass) quadrant, Fig. 4 the upper right (basic, 
high molecular mass) quadrant, Fig. 5 the lower left (acidic, 
low molecular mass) quadrant, and Fig. 6 the lower right 
(basic, low molecular mass) quadrant. The quadrants over- 
lap as an aid to moving between them. The gel position (in 
100 micron units), isoelectric point (relative to the CPK in- 
ternal p/standards) and SDS molecular mass (from the cali- 
bration curve in Fig. 8) are listed for each spot (Table 1). Be- 
cause of the precision of the CPK-p/ values, these parame- 
ters can be used to relate spot locations between gel sys- 
tems more reliably than using p/ measurements expressed 
as pH. A major objective of current studies is the identifica- 
tion of all major spots corresponding to known liver pro- 
teins, as well as rigorous definitions of subcellular orga- 
nelle contents. Of particular interest to us is the parallel de- 
velopment of identifications in the rat and mouse liver 
maps, allowing detailed comparisons of gene expression ef- 
fects in the two systems. The results of these studies will be 
presented systematically in a later edition of this database, 
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but we include here a useful series of 22 orienting identifi- 
cations as an aid to other users of the rat liver pattern (Table 
2). 

3.2 Carbamylated charge standards, computed p/s and 
molecular mass standardization 

We have previously shown that the use of a system of close- 
ly-spaced internal pi markers (made by carbamylating a 
basic protein) offers an accurate and workable solution to 
the problem of assigning positions in the p/ dimension [32]. 
The same system, based on 36 protein species made by car- 
bamylating rabbit muscle CPK, has been used here to as- 
sign pfs to most rat liver acidic and neutral proteins. The 
standards were coelectrophoresed with total liver proteins, 
and the standard spots added to a special version of the 
master pattern F344MST3. The gel ^-coordinates of all 
liver protein spots lying within the CPK charge train were 
then transformed into CPK pi positions by interpolation 
between the positions of immediately adjacent standards 
(Table 1) using a Kepler® vector procedure. 

It has proven possible to compute fairly accurate pi values 
for many proteins from the amino acid composition [42]. 
We have attempted here to test a further elaboration of this 
approach, in which we computed pfs for the CPK standards 
themselves, based on our knowledge of the rabbit muscle 
CPK sequence and the fact that adjacent members of the 
charge train typically differ by blockage of one additional ly- 
sine residue (Table 3). We compared these values to similar 
computed pfs for an additional set of carbamylated stand- 
ards made from human hemoglobin beta chains and a se- 
ries of rat liver and human plasma proteins of known posi- 
tion and sequence (Fig. 7,Table 4).The result demonstrates 
good concordance between these systems. Two proteins 
show significant deviations: liver fatty-acid binding protein 
(FABP; #1 in Table 4) and protein disulphide isomerase 
(#20 in the table). The FABP spot present on F344MST3 
may represent a charge-modified version of a more basic 
parent spot closer to the expected p/, not resolved in the 
IEF/SDS gel. Of particular importance is the fact that, by 
comparing computed pfs of sequenced but unlocated pro- 
teins with the CPK pfs, we can assign a probable gel loca- 
tion without making any assumptions regarding the actual 
gel pH gradient. This offers a useful shortcut, given the va- 
garies of pH measurement on small diameter IEF gels. We 
have used this approach to compute the CPK pfs of all rat 
and mouse proteins in the PIR sequence database, as an aid 
to protein identification (data not shown). 

In order to standardize SDS molecular weight (SDS-MW), 
we have used a standard curve fitted to a series of identified 
proteins (Fig. 8). Rather than using molecular mass perse, 
we have elected to use the number of amino acids in the 
polypeptide chain, as perhaps a better indication of the 
length of the SDS-coated rod that is sieved by the second 
dimension slab. The resulting values were multiplied by 
112 (the weighted average mass of amino acids in se- 
quenced proteins) to give predicted molecular masses. Be- 
cause we use gradient slabs, we have not constrained the fit- 
ted curve to conform to any predetermined model; rather 
we tried many equations and selected the best using the 
program "Tablecurve" on a PC. The equation chosen wasj> 
= a + bx + cAr , where y is the number of residues,* is the gel 



^coordinate, a is 511.83, b is -0.2731 and cis 33183801. The 
resulting fit appears to be fairly good over a broad range of 
molecular mass. 

3.3 An example of rat liver gene regulation: Cholesterol 
metabolism 

Experiment LSBC04 was designed as a small-scale test of 
the regulation of cholesterol metabolism in vivo by three 
agents included in the diet: lovastatin (Mevacor®, an inhibi- 
tor of HMG-CoA reductase); cholestyramine (a bile acid 
sequestrant that has the effect of removing cholesterol 
from the gut-liver recirculation); and cholesterol itself. The 
first two agents should lower available cholesterol and the 
third should raise it, allowing manipulation of relevant 
gene expression control systems in both directions. Such 
an experiment offers an interesting test of the 2-D mapping 
system since most of the pathway enzymes are present in 
low abundance, many are membrane-bound and difficult 
to solubilize, and the pathway itself is complex. Approxima- 
tely 1000 proteins were separated and detected in liver ho- 
mogenates. Twenty-one proteins were found to be affected 
by at least one treatment, and these could be divided into 
several coregulated groups. 

3.3.1 MSN 413 (putative cytosolic HMG-CoA synthase) 
and sets of spots regulated coordinated or inversely 

One group of spots (including a spot assigned to the cyto- 
solic HMG-CoA synthase, M SN 4 13) showed the expected 
increase in abundance with lovastatin or cholestyramine, 
the synergistic further increase with lovastatin and choles- 
tyramine, and a dramatic decrease with the high cholesterol 
diet. Spot number 413 is the most strongly regulated pro- 
tein in the present experiment, showing a 5- to 10-fold in- 
duction after a 1 week treatment with 0.075 % lovastatin and 
1% cholestyramine in the diet (Figs. 9 and 10). Its expres- 
sion follows precisely the expectation for an enzyme whose 
abundance is controlled by the cholesterol level; it is pro- 
gressively increased from the control levels by cholestyra- 
mine, lovastatin and lovastatin plus cholestyramine, and it 
sinks below the threshold of detection in animals fed the 
high cholesterol diet. This spot has been tentatively identi- 
fied as the cytosolic HMG-CoA synthase, based on a reac- 
tion with an antiserum to that protein provided by Dr. Mi- 
chael Greenspan at Merck Sharp & Dohme Research Labo- 
ratories. This enzyme lies immediately before HMG-CoA 
reductase in the liver cholesterol biosynthesis pathway, and 
is known to be co-regulated with it. Spot 413 has an SDS 
molecular weight of about 54 000 and a CPK pi of -1 1 .4, in 
reasonably close agreement with a molecular weight of 
57300 and a CPK p/of -15.7 computed from the known se- 
quence of the hamster enzyme [43]. 

Using a classical product-moment correlation test (Kepler 
procedure CORREL), a series of five additional spots was 
found to be coregulated with 413. The level of correlation 
was exceedingly high (> 95%). Two of these, 1250 and 933, 
are at similar molecular weights and approximately one 
charge more acidic than 413 (Fig. 9), indicating that they 
may be covalently modified forms of the 413 polypeptide. 
This suspicion is strengthened by the observation that both 
spots are also stained by the antibody to cytosolic HMG- 
CoA synthase. The remaining three correlated spots appear 
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to comprise an additional related pair (1253 and 1001) i of 
around 40 kDa and a single spot (1119) of around 28 kDa. 
Because these two presumed proteins are present at sub- 
stantially lower abundances than 413, and because the cyto- 
solic HMG-CoA synthase is reported to consist of only one 
type of polypeptide, they are likely to represent other, very 
tightly coregulated enzymes. A second group of six spote 
was selected based on a regulatory pattern close to .the in- 
verse of that for spot 413 (MSN's 34 79 178, W W^47; 
data not shown). For these proteins, the lowest level of ex- 
pression occurs with exposure to lovastatm plus cholestyra- 
mine and the highest level upon exposure to the high-'ho- 
EXol diet. Spots 182 and 79 are highly correlated and he 
about one charge apart at the same "^"^^foS 
may thus be isoforms of a single protein. Trie other four 
spots probably represent additional enzymes or subunits. 

3.3.2 MSN 235 and coregulated spots 

A third group of five spots, mainly comprised of mitochon- 
drial proteins including putative mitochondrial HMG- 
CoA synthase spots, showed a modest induction by lovasta- 
tin alone, but little or no effect with any of the other treat- 
ments (including the combination of lovastatm and choles- 
tyramine* Fig. 12). This result is intriguing because lovasta- 
tin was expected to affect only the regulation of enzymes of 
cholesterol synthesis, which is entirely extra-mitochon- 
drial. Three of the spots (235, 134, 144) form a closely- 
packed triad at approximately 30 kDa, and are likely to re- 
present isoforms of one protein. All three spots ^are > stained 
by an antibody to the mitochondrial form of HMCj-U)A 
synthase obtained from Dr. Greenspan. Subcellular fractio- 
nation indicates a mitochondrial location. The other two 
spots (633 at about 38 kDa and 724 at about 69 kDa) are 
each present at lower abundance than the members of the 
triad. 



proteins of the putative mitochondrial pathway are so 
much more variable in their expression in all groups. An ex- 
amination of all the coregulated groups suggests that quan- 
titative statistical techniques can extract a wealth of inter- 
esting information from large sets of reproducible gels.Hie 
abundance of spots in the 413 coregulation group, for exam- 
ple shows an amazing level of concordance in their relative 
expression among the five individuals of the lovastatm and 
cholestyramine treatment group. This effect is not due to 
differences in total protein loading, since they have already 
been removed by scaling, and since proteins with quite dif- 
ferent regulation patterns can be demonstrated (e.g., Fig. 
13) Such effects raise the possibility that many gene coregu- 
lation sets may be revealed through the study of a suffi- 
ciently large population of control animals (i.e., without 
any experimental manipulation). This approach, exploiting 
natural biological variation in protein expression instead of 
drug effects, offers an important incentive for the construc- 
tion of a large library of control animal patterns. 



4 Conclusions 

Because of the widespread use of rat liver in both basic bio- 
chemistry and in toxicology, there is a long-term need for a 
comprehensive database of liver proteins. The rat liver mas- 
ter pattern presented here has proven to be an accurate re- 
presentation of this system, having been matched to more 
than 700 gels to date. As the number of proteins identified 
and the number of compounds tested for gene expression 
effects grows, we expect this database to contribute valu- 
able insights into gene regulation. Its practical utility in sev- 
eral areas of mechanistic toxicology is already being de- 
monstrated. 

Received September 11, 1991 



3.33 An example of an anti-synergistic effect 

A sixth spot (367) shows strong induction by lovastatin 
(two- to threefold), and about half as much induction with 
lovastatin plus cholestyramine, but without sharing the ani- 
mal-animal heterogeneity pattern of the 235-set (Fig. 13). 
This protein is also mitochondrial, and represents the clear- 
est example of an anti-synergistic effect of lovastatin and 
cholestyramine. The existence of such an effect demon- 
strates that lovastatin and cholestyramine do not act exclu- 
sively through the same regulatory pathway. 

3.3.4 Complexity of the cholesterol synthesis pathway 

Taken together, these results suggest that treatment with lo- 
vastatin alone can affect both cytosolic and mitochondrial 
pathways using HMG-CoA, while cholestyramine, on the 
other hand, either alone or in combination with lovastatin, 
produces a strong effect on the putative cytosolic pathway, 
but little or no effect on the putative mitochondrial path- 
way An explanation for this difference may he in lovasta- 
tin's effect on levels of HMG-CoA and related precursor 
compounds that are exchanged between the cytosol and 
the mitochondrion, whereas cholestyramine should affect 
only the cytosolic pathways directly controlled by cholester- 
ol and bile acid levels. It remains to be explained why some 
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v L Synthetic representation of the standard rat liver 2-D master pattern, rendered as a greyscale image using a 
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<4 Figure 7. (a) Plot of computed isoelectric point versus gel JT-position for 
two sets of carbamylated standard proteins (rabbit muscle CPK [+] and 
human hemoglobin 0 chain, filled diamonds) and several other proteins 
(shaded squares), (b) The identities of the various proteins represented 
by the squares are indicated by the numbers in corresponding positions 
on (a); these refer to Table 4. 
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Figure 9. Montage showing effects in the 
region of MSN :4 13. The montage shows a 
small window into one portion of the 2-D 
pattern, one row of windows for each expe- 
rimental group, and one panel for each gel 
in the experiment. The left-most pattern 
in each row is a group-specific copy of the 
master pattern followed by the patterns 
for the five individual rats in the group. 
The highlighted protein spots (filled circ- 
les) are spot 413 (on the right of each pan- 
el; identified as cytosolic HMG-CoA syn- 
thase) and two modified forms of it (1250 
and 933). From the top, the rows (experi- 
mental groups) are: high cholesterol, con- 
trols, cholestyramine, lovastatin, and lova- 
statin plus cholestyramine. 
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Regulation of Rat Liver 413 



(Putative Cytosolic HMG-CoA Synthase, 53kd) 
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Figure 70. Bargraph showing the quantita- 
tive effects of various treatments on the 
abundance of MSN:413 (cytosolic HMG- 
CoA synthase) in the gels of Fig. 9. 




Figure I L Bargraphs of a series of six core- 
gulated spots including MSN:413. In the 
bargraphs, the abundances of the appro- 
priate spot (master spot number shown at 
the top of the panel) in each animal are 
shown. The five five-animal groups are in 
the order (left to right): high cholesterol, 
controls, cholestyramine, lovastatin, and 
lovastatin plus cholestyramine. Each bar 
within a group represents one experimen- 
tal animal liver (one 2-D gel). Note the cor- 
related expression of the 6 spots, espe- 
cially in the two far right (most strongly in- 
duced) groups. 
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Figure 12. Data on a second coregulated 
group of spots, presented as in Fig. 1 l.The 
fourth experimental group (lovastatin) 
shows a modest induction, while the fifth 
group (lovastatin plus cholestyramine) 
does not. 




Figure 13. Data on spot MSN:367, presented as in Fig. 11. This protein 
shows unambiguously the anti-synergistic effect of lovastatin and choles- 
tyramine (fifth group) as compared to lovastatin (fourth group). This res- 
ponse contrasts strongly with the regulation pattern seen in Fig. H- 
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a) Master table of proteins in the rat liver database, showing spot master number, gel position (x and>), isoelectric point relative to CPK standards, 
predicted molecular mass (from the standard curve of Fig. 8). 
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Table 3. Computed p/*s of two sets of carbamylated protein standards: Rabbit muscle CPK and human 
hemoglobin (Hb) 



PIR #ASP #GLU #HIS #LYS #ARG NH2- Calc Real 





Protein Name 


Name 


3.9 


4.1 


6.0 


10.8 


12.5 


7.0 


Dl 


CPK 


0 


Rabbit muscle CPK 


KIRBCM 


28 


27 


17 


34 


18 


1 


6.84 


0.0 


-1 


- 




28 


27 


17 


33 


18 




6.67 


-1 


-2 






28 


27 


17 


32 


18 




6.54 


-2 


-3 






28 


27 


17 


31 


18 


1 


6.42 


-3 


-4 






28 


27 


17 


30 


18 




6.31 


-4 


-5 






28 


27 


17 


29 


18 


1 


6.21 


-5 


-6 






28 


27 


17 


28 


18 


1 


6.12 


-6 


-7 






28 


27 


17 


27 


18 


1 


6.03 


-7 


-8 






28 


27 


17 


26 


18 


1 


5.94 


-8 


-9 






28 


27 


17 


25 


18 


1 


5.85 


-9 


-10 






28 


27 


17 


24 


18 


1 


5.76 


-10 


-11 






28 


27 


17 


23 


18 


1 


5.67 


-11 


-12 






28 


27 


17 


22 


18 


1 


5.58 


-12 


-13 






28 


27 


17 


21 


18 


1 


5.48 


-13 


-14 






28 


27 


17 


20 


18 


1 


5.39 


-14 


-15 






28 


27 


17 


19 


18 


1 


5.29 


-15 


-16 






28 


27 


17 


18 


18 


1 


5.20 


-16 


-17 






28 


27 


17 


17 


18 


1 


5.12 


-17 


-18 






28 


27 


17 


16 


18 


1 


5.04 


-18 


-19 






28 


27 


17 


15 


18 


1 


4.96 


-19 


-20 






28 


27 


17 


14 


18 


1 


4.89 


-20 


-21 






28 


27 


17 


13 


18 


1 


4.83 


-21 


-22 






28 


27 


17 


12 


18 




4.77 


-22 


-23 






28 


27 


17 


11 


18 


1 


4.71 


•23 


-24 






28 


27 


17 


10 


18 


] 


4.66 


-24 


-25 






28 


27 


17 


9 


18 




4.61 


-25 


-26 






28 


27 


17 


8 


18 


1 


4.56 


-26 


-27 






28 


27 


17 


7 


18 


1 


4.52 


-27 


-28 






28 


27 


17 


6 


18 


1 


4.48 


-28 


-29 






28 


27 


17 


5 


18 


1 


4.44 


-29 


-30 






28 


27 


17 


4 


18 


1 


4.40 


-30 


-31 






28 


27 


17 


3 


18 


1 


4.36 


-31 


-32 






28 


27 


17 


2 


18 


1 


4.32 


-32 


-33 






28 


27 


17 


1 


18 


1 


4.29 


-33 


-34 






28 


27 


17 


0 


18 


1 


4.25 


-34 


-35 






28 


27 


17 


0 


18 




4.22 


-35 


0 


Hb-beta, human 


HBHU 


7 


8 


9 


11 


3 


1 


7.18 




-1 






7 
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9 


10 


3 


1 


6.79 




-2 
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9 


3 




6.53 


-1.8 


-3 
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8 


3 




6.32 


-3.2 


-4 
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7 


3 




6.13 


-5.3 


-5 
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6 


3 




5.96 


-7.2 


-6 
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3 




5.78 


-10.0 


-7. 
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.;- 
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-12 
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4.54 


-27.2 
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Table 4, 



Computed p/*s of some known proteins related to measured CPK pfs 



Protein Name 

0 Creatine phospho kinase (CPK), rabbit muscle 

1 Fatty acid-binding protein, rat hepatic 

2 b2-microglobulin, human 

3 Carbamoyl-phosphate synthase, rat 

4 Prealbumin ( serum albumin precursor), rat 

5 Serum albumin, rat 

6 Superoxid dismutase (Cu-Zn, SOD), rat 

7 Phospholipase C, phophoinositide-specific (?), rat 

8 Albumin, human 

9 Apo A-l lipoprotein, rat 

I o proApo A-l lipoprotein, human 

I I NADPH cytochrome P-450 reductase, rat 

1 2 Retinol binding protein, human 

13 Actin beta, rat 

14 Actin gamma, rat 

1 5 Apo A-l lipoprotein, human 

1 6 Apo A-IV lipoprotein, human 

17 Tubulin alpha, rat 

1 8 F1 ATPase beta, bovine 

19 Tubulin beta, pig 

20 Protein disulphide isomerase (PDI), rat hepatic 

21 Cytochrome b5, rat 

22 Apo C-ll lipoprotein, human 

Amino acid pi assumed in calulation: 



PIR 
Name 

KIRBCM 
FZRTL 
MGHUB2 
SYRTCA 
ABRTS 
ABRTS 
A26810 
A28807 
ABHUS 
A24700 
LPHUA1 
RDRT04 
VAHU 
ATRTC 
ATRTC 
LPHUA1 
LPHUA4 
UBRTA 
PWBOB 
UBPGB 
ISRTSS 
CBRT5 
LPHUC2 



#ASP #GLU #HIS #LYS #ARG Calc Real 
3.9 4.1 6.0 10.8 12.5 pi CPK 
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